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Abstract 

A new background rejection strategy for 7-ray astrophysics with stereoscopic Imaging 
Atmospheric Cherenkov Telescopes (lACT), based on Monte Carlo (MC) simulations 
and real background data from the H.E.S.S. [High Energy Stereoscopic System, see 
experiment, is described. The analysis is based on a multivariate combination 
of both previously-known and newly-derived discriminant variables using the physical 
shower properties, as well as its multiple images, for a total of eight variables. Two 
of these new variables are defined thanks to a new energy evaluation procedure, which 
is also presented here. The method allows an enhanced sensitivity with the current 
generation of ground-based Cherenkov telescopes to be achieved, and at the same time 
its main features of rapidity and flexibility allow an easy generalization to any type of 
lACT. The robustness against Night Sky Background (NSB) variations of this approach 
is tested with MC simulated events. The overall consistency of the analysis chain has 
been checked by comparison of the real 7-ray signal obtained from H.E.S.S. observations 
with MC simulations and through reconstruction of known source spectra. Finally, the 
performance has been evaluated by application to faint H.E.S.S. sources. The gain in 
sensitivity as compared to the best standard HiUas analysis ranges approximately from 
1.2 to 1.8 depending on the source characteristics, which corresponds to an economy in 
observation time of a factor 1.4 to 3.2. 

Keywords: 7/hadron discrimination; weak 7-ray sources; Imaging Atmospheric 
Cherenkov Telescopes; multivariate analysis; boosted decision trees. 



1. Introduction 

Very High Energy (VHE) 7-rays interact in the Earth's atmosphere, giving rise to 
highly relativistic showers which produce a few-nanosecond flash of Cherenkov photons 
in a "light pool" of ~ 250 m diameter. This can be detected by one or more ground-based 
telescopes, which collect this light using a mirror that reflects it onto a camera composed 
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by an array of photomultipliers, converting the photons to a charge per pixeljj and thus 
giving an image of the shower as seen by each telescope. 7-ray induced atmospheric 
showers have a rotational symmetry around the shower axis (neglecting geomagnetic 
effects), which results in elliptical images on the cameras. By combining the informa- 
tion from all the shower images seen, the arrival direction and the energy of the 7-ray 
can be reconstructed by well-established methods j2|, [sj . The 7-ray signal is, however, 
drowned out by an enormous number of background air showers initiated by charged 
hadronic cosmic-rays. Contrarily to the 7-rays, the hadronic component of air show- 
ers is characterized by an asymmetric development of the particle cascade due to the 
hadronic interactions, which have large transverse momenta, and due to longer-lived 
muons reaching the telescope altitude. 

After decades of research jif, great progress has been made in the field of VHE 
7-ray astronomy in recent years, with the advent of the new generation of lACTs. The 
major keys to this success have been stereoscopy - offering a three-dimensional view of 
the atmospheric shower - and fast imaging cameras characterized both by their time 
response at the nanosecond scale and by a fine pixel granularity. These features have 
led to larger effective detection better angular resolution and a much improved 

discrimination power against the abundant cosmic-ray background. Nonetheless, the 
detection using standard analysis methods of sources at the level of 1% of the Crab 
nebula flux requires several tens of hours with arrays such as H.E.S.S. or VERITAS, even 
at their optimal sensitivity, out of ~ 1000 h/year of available good- weather moonless 
nights. This observation time needed for detection of very low-fiux sources, for the 
discovery of new classes of 7-ray emitting sources, or for detailed morphological studies 
of extended sources can even become prohibitively long if mirror ageing occurs, causing 
an increase in threshold energy and thus a decrease in sensitivity. 

Long experience has demonstrated the robustness against changing data-taking con- 
ditions of the standard Hillas-parameter based analysis (see and also a short descrip- 
tion in Sec. [2]), essentially because of its stability under variable conditions, allowing its 
application to the entire set of lACTs world-wide since the 1980 's. However, this kind of 
analysis does not exploit the correlation between images of the same shower as seen by 
different telescopes in a simple way, nor take advantage of fine-pixelization imaging. So, 
in recent years, additional methods have been developed in order to enhance the analysis 
sensitivity by fitting a model of the Cherenkov photons produced by a 7-ray shower to 
the images seen by each telescope, overcoming the above-mentioned limitations. 

A first successful development was carried out over a decade ago for the CAT exper- 
iment in mono-telescope mode (gI with the 2D-model, where the image of the shower 
(in particular its longitudinal profile but also its transverse profile) is used to predict 
the angular origin of the event along the image axis, allowing the event direction on the 



^For example, in the first phase of the H.E.S.S. experiment, the reflective mirror area is 107 m^ and 
the pixel size is 0.16°, which is quite well adapted to the shower image size, with four such telescopes 
spaced by 120 m making up the stereoscopic array, see 
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sky to be obtained, as well as giving a x^-based discrimination variable for background 
rejection. This method has subsequently been improved (passing from a minimiza- 
tion to a maximum-likelihood, among other developments), and has been optimized to 
take full advantage of the stereoscopy provided by H.E.S.S. (see In addition, a 

three-dimensional (3D-model) reconstruction method has been introduced in H.E.S.S. 
[3], where the result of the fit is a set of parameters describing the shower characteris- 
tics and where the selection analysis cuts are defined on the physical shower properties, 
rather than on the likelihood value. An upgraded version of the 3D-model analysis in- 
troduced a cut based on a value in order to include the information from untriggered 
telescopes, see j^. More recently, two different types of multivariate analyses (MVA) 
have been applied for H.E.S.S.: the first using Hillas-parameter based discriminant vari- 
ables [9!]; the second using discrimination based on Hillas-parameter based variables, 
plus 3D-model parameters and 2D-model goodness-of-fit [10]. 

The new method presented in this paper is based on an MVA approach optimized 
for the detection of low-flux sources, in which new ideas have also been implemented. 
First of all, the decision was taken to explore only discriminant parameters which are 
not a goodness-of-fit value, principally so as to lower the sensitivity of the analysis to 
uncertainties due to NSB variations. Indeed, in order for the systematics due to the NSB 
to be under control, goodness-of-fit based discrimination analyses need a rather precise 
knowledge of the underlying probability distribution (for likelihood fitsj§ of the pixel 
response to 7-ray showers and noise, together with its variation as a function of the NSB 
rate (see for example Sec. 3.7 of j3] and the definition of [Gj]). These distributions may 
also be sensitive to imperfections of the electronics or the acquisition system and their 
drift over time, which might be difficult to reproduc^ or time-costly to implement with 
MC simulations of the instrument. Avoiding such types of parameters should provide a 
more "portable" method, applicable to many current and future lACTs. Secondly, the 
information carried by the 3D-model about the predicted shape of 7-ray images on the 
cameras is found to provide additional discrimination power when the Hillas moments 
of the predicted images are calculated and used to derive new discriminant variables. 
Thirdly, an energy reconstruction method has been developed explicitly for this work, 
the conception of which has allowed the definition of two new discriminant parameters. 
These parameters, combined with a third new one - all of which are detailed below - 
provide additional power for hadron rejection. Special attention has been paid in this 
work to the cut optimization for different expected source characteristics (intensity and 
spectral hardness), as well as for studies of source morphology. 

After a brief overview of the choices made for stereo reconstruction (Sec. [2]) a versa- 
tile 7-ray energy reconstruction algorithm will be presented in Sec. [31 which is used as 
the basis for two newly-developed parameters. The parameters chosen for the 7/hadron 
discrimination are introduced in Sec. HI while the details of the multivariate event clas- 



^Or its error in case of a fit. 

•^Some years of operation experience have been found to be necessary to understand and characterize 



such effects. 
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sification method are presented in Sec. [51 The stabihty of the method is tested through 
comparison of MC simulations and real 7-ray data in Sec. El Finally, in Sec. [71 the 
performance is evaluated through application to H.E.S.S. sources. 

2. Stereo reconstruction 

The steps of the shower reconstruction process used for the present work are de- 
scribed here. First of all, the images triggered by the event in the telescopes are cleaned 
so as to eliminate the pixels not containing a significant fraction of the Cherenkov photon 
signal (comparable to the NSB fluctuations) using the methodology described in [3|0. 
The charges of the selected pixels are used to calculate the first and second moments of 
the images in each telescope, allowing the "Hillas parameters" to be derived (equivalent 
to fitting an ellipse to each image). A Hillas ellipse is characterized by its length and 
width, as well as by its position with respect to the camera centre, by the total charge in 
photo-electrons (p.e.), and by its main axis direction. Images whose centres-of-gravity 
fall near the edge of the cameras are discarded so as to minimize shower parameter mis- 
reconstruction due to image truncation; all the remaining images passing a pre-defined 
charge threshold are used to calculate the shower arrival direction and the intersection 
of the shower axis with the ground (also known as impact position), see (2!, [sf. The 
geometry of the stereoscopic images allows the slant depth of the shower maximum to 
be estimated, assuming the centre of gravity of the images to be the projection of the 
maximum into the image plane. 

The shower characteristics derived from the Hillas-parameter based variables are 
useful as such, but also as the first guess for the subsequent 3D-model optimization 
j^. In the 3D- model, the 7-ray shower is modelled by a 3D-photosphere of Cherenkov 
photon origins in the atmosphere which is assumed to have a Gaussian distribution along 
all axes. This photosphere is described by eight parameters which are intrinsic to the 
shower: the direction and the impact position of the incident 7-ray, the photosphere 
length (along the shower axis) and transverse width (rotational symmetry around the 
shower axis being assumed), the slant depth of the shower maximum, and the number of 
Cherenkov photons generated by the electrons and positrons in the shower development. 
The 3D-model is used to predict the distribution of Cherenkov light in the cameras of 
a telescope array; then the intrinsic parameters of the shower are adjusted to achieve 
a good fit (using a maximum-likelihood optimization) of the ensemble of 7-ray shower 
images. 

Given the Hillas and the 3D-model reconstruction procedures, two estimations of 
the event direction are available. The 3D-model point- spread-function (PSF) is better 
than that provided by the Hillas-parameter based reconstruction for energies below 
500 GeV, whereas the inverse applies for higher energies. In this work, the Hillas- 



The two-threshold image cleaning used in this work is of type 5-10 p.e.; and a pixel is also rejected 
if its charge is less than Scr above its pedestal level. This is the case for the standard H.E.S.S. analysis. 
Other lACTs use thresholds which are adapted to their instruments. 
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parameter based direction has been chosen for simphcity of implementation. Studies of 
the use of optimized combination of the two reconstructed directions will be undertaken 
for a future upgrade. 



3. Energy reconstruction 

In ground-based 7-ray astronomy, sources are observed under various conditions of 
zenith angle, under which the images produced by 7-ray showers propagating in the 
atmosphere vary markedly, and the shower images also vary depending on the impact 
parameter of the shower with respect to the given telescope position fsee for example 
the two-dimensional image profiles for such varying conditions in In addition, 

sources may be observed at different offsets with respect to the centre of the camera, 
resulting in different detector responses. Moreover, any loss of photon collection or 
detection efficiency can play an important role in the evaluation of the total image 
charge amplitude. Therefore, the data analysis in this field is characterised by a wide 
use of lookup tables, in which the 7-ray image characteristics as a function of the varying 
data taking conditions are stored to allow the modelling of the detector response to signal 
events. 



The energy of the 7-ray shower can be reconstructed by using such lookup tables 
generated either with the Hillas-parameter based information, or with that provided by 
the 3D-model. However, the method for the evaluation of the energy with the 3D-model 
requires the convergence of the model fit, which is not guaranteed, especially for low- 
charge two-telescope events. As a generic method usable for all analysis configurations 
was deemed preferable, the choice was made to work in the Hillas-parameter based 
reconstruction framework. The proposed method of calculating the 7-ray energy has the 
advantage of being flexible, because the definition of the charge profiles can be performed 
with any kind of 7-ray simulation, and moreover, the energy evaluation phase is relati vely 



fast with respect to other alternative methods, such as those requiring minimization [12 
The energy reconstruction method presented here is composed of two separate phases: 
a preparatory phase and an evaluation phase. 

3.1. Preparatory phase 

The preparatory phase requires the generation of lookup tables {charge profiles, see 
for example Fig. [T^) of the sum of the charges measured in a simulated 7-ray cleaned 
image as a function of the impact parameter (referred to below as R). These are created 
for discrete sets of values of the simulated energy -Etrue, of zenith angles G, of values of 
the mirror efficiency M , of offsets D of the shower angular origin with respect to the 
centre of the camera, and for multiple bands of reconstructed shower maximum depths 
H (as estimated from the Hillas parameters of the images). Note that these charge 
profiles are related to the Cherenkov light profile on the ground, but as detected in the 
images seen by the telescopes (including the photomultiplier efficiency with wavelength 
and image cleaning effects, etc.). 
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Figure 1: (a) Profiles of tlie ctiarges of ttie cleaned images in tlie cameras as a function of the 
reconstructed impact parameters at tliree different observing angles 0° (points), 32° (squares) 
and 41° (triangles), for a given energy, shower maximum, offset and mirror efficiency; the 
error bars shown represent the spread of the distributions, (b) Schematic of the tree used 
for the evaluation of the event energy for each telescope. For an event having a zenith angle 
©reco G [©1,02]! an offset Dj-eco £ [Di, D2], a maximum depth i/reco £ [Hi,H2], an impact 
parameter i?reco £ [-^ii -^2], and being observed in a telescope having a mean mirror efficiency 
Mxeii G [Mi,M2], the logiQ Ej and the weight Wj are calculated for each branch of the tree 
and then the different values are combined to evaluate the event energy per telescope -Exeii- 



3.2. Energy evaluation phase 

The energy evaluation phase consists of the use of these pre-generated charge profiles 
to calculate the event energy based on a weighted combination of the estimates of the 
energy in each telescope. The strategy for the evaluation of the energy in each telescope 
can be easily visualized by a tree, see Fig. [T]d. For an event having a triggered telescope 
Telj with an associated mirror efficiency Mxei^ and a measured charge qTeU, there are four 
reconstructed parameters: the zenith angle Oreco, the offset Z^reco, the maximum depth 
i^reco and the impact parameter with respect to the position of the telescope -Rreco- The 
weight of each branch of the tree Wj can be calculated, where Wj is given by: 

5 

Wj = JJ^Pfc where pk = Wk or pk = (1 — Wk) depending on the branch taken, (1) 

fc=i 

and where the weights Wk are given by the interpolation of the nearest simulated values. 
As an example, for an event having a reconstructed zenith angle Greco the algorithm 
searches for the two fixed values [0i, Q2] between which to interpolate such that Orcco G 
[9i, 62], and applies a weight of wq = (Oreco — 0i)/(02 — ©i)- For each branch of the 
tree, given the value of the measured charge gTei^, the event energy associated with the 
telescope can be retrieved using the charge profiles mentioned in Sec. 13.11 which were 
defined for a set of energy values. The energy in each triggered telescope can thus be 
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Figure 2: Given the distribution of log{Ereco / Etme) , where Ereco and -Etrue are the recon- 
structed and simulated energy values, its standard deviation defines the energy resolution (a), 
and the mean value defines the bias (b). The two curves refer to two analysis configurations 
(40 p.e. and 150 p.e., dashed and continuous line respectively, see Sec. [57 



easily calculated by a weighted mean on all the energy values and weights obtained from 
the 2^ = 32 branches: 

logio ^Tel, = ^32 ... (2) 



The energies -ETeii calculated separately for each triggered telescope are combined in the 
final step with a charge-weighted mean: 



Ereco = ^ ^ [qTeU " -^TelJ , wherC Qtot = ^ ^Tel, 



(3) 



i=l 



Qtot being the total charge of the event. The resolution and bias of the current imple- 
mentation of the algorithm can be seen in Fig. |2l in the energy range 0.2-30 TeV the 
resolution is almost stable at ~ 15%, while the bias is better than 5% for E > 0.3 TeV. 



4. Selected 7/hadron discriminant parameters 

The definitions of the discriminant parameters take advantage of the information 
carried by the two different stereo reconstruction algorithms discussed in Sec. [2] (the 
3D-model and Hillas-parameter based ones) and of the energy reconstruction algorithm 
introduced in Sec. 13.21 As will be mentioned in Sec. 15. 3[ the parameters have been 
chosen according to their discriminant power and in order to avoid strongly-correlated 
variables. 
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Figure 3: 3D-width (a) and 3D- width-err (b) for the H.E.S.S. array and for energies in the 
range 1-2 TeV. The 3D-depth (c) is shown for the range 100-300 GeV. All three parameters 
are shown for the 10-20° zenith angle range. Filled and hatched distributions represent the 
7-ray simulated events and the measured cosmic-ray background, respectively. 



4.1. Hillas-parameter based discriminant variables 

For a given reconstructed shower, two powerful discriminant parameters commonly 
used for stereoscopic arrays can be defined: a Mean Scaled Length (MSCL) and a 
Mean Scaled Width (MSCW). As described in Q, lookup tables are prepared based 
on simulations, which contain the mean width and length and their respective rms 
scatter for a 7-ray as a function of the amplitude of the shower image in the camera 
and the impact parameter relative to the telescope. The MSCL and MSCW are then 
calculated as the mean values of the deviations of the measured lengths and widths 
from the predicted mean values, divided by their rms. In short, these mean scaled 
parameters indicate deviations of the shape of the detected images compared to those 
resulting from the 7-ray simulations, and hence allow for discrimination, see Fig. 7 of 
for further details. The Hillas-parameter based reconstruction and discrimination have 
been the basis of a large number of discoveries in recent years, showing that these two 
simple scaled parameters were sufficient for an efficient 7/hadron discrimination in case 
of bright sources (i.e., > 10% C.u|§). 

4-2. 3D-model based discriminant parameters 

As described in Sec. [H eight parameters are adjusted by the 3D-model fit, which 
also provides the error estimates for these parameters. Among these, two parameters, 
the reduced 3D-widtl^ and the depth of shower maximum (3D-depth), together with 
the error on the former parameter {3D-width-err) , have been retained for their high 
discriminant power. The distribution for the 3D-width for 7-rays (Fig. ^1^) is narrower 



^ "Crab Units", i.e., compared to a hypothetical source with the Crab nebula intensity and spectrum 
at the zenith angle of observation. 

^ As the Cherenkov photosphere is found to be dependent on the variation of the Cherenkov threshold 
with the altitude of the shower maximum, an efficient and dimensionless 7/hadron discrimination 
parameter called reduced 3D-width has been defined, though for simplicity this is referred to as the 
3D-width in the following. 
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Experimental Image 3D-model Prediction 




Figure 4: 7-ray and hadron images seen in a camera. The upper panels show the simu- 
lated 7-ray cleaned image {left panel) and corresponding prediction given by the 3D-model 
optimization {right panel), while the lower panels show a real cleaned hadron image and its 
corresponding 3D-model prediction. 

and has a peak at lower values compared to that of the background. The same applies to 
the 3D-width-err (Fig. [3]d), showing that the model is not suitable for the representation 
of a hadron-induced shower, thus giving larger error values as a consequence, with an 
associated discrimination power. Moreover, it was found that the two 3D-width related 
parameters show a significant separation performance over the entire energy range, while 
3D-depth (Fig. [Sb) is highly discriminant for lower energies, with decreasing separation 
power with increasing energy, as will be shown also in Sec. 15.31 

4-3. New discriminant parameters 

Beyond the five above-mentioned discriminant parameters, three new ones have been 
defined for the purpose of further enhancing the 7/hadron separation performance. Due 
to the difference in the shower development, especially regarding its rotational asym- 
metry/symmetry around the shower axis, even in the case where the fit of a hadronic 
shower with a 7-ray model (as for instance in fl) succeeds, it may lead to incoheren- 
cies - in both shape and total charge - between the predicted shower images and those 
observed (or absent) in the individual telescopes (see Fig. H]). The alternative method 
proposed here is to explore these differences by defining additional Hillas parameters 
(here called HiUasOnModel parameters) using the predicted images in the cameras, to 
which the same cleaning and moment calculation procedures are applied and which can 
be used to define new discriminant variables. In this way the inter-telescope correlations 
which are inherently included by the 3D-model optimization are further exploited in a 
simple Hillas-parameter based discrimination framework. 
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Figure 5: The O (a), Re (b) and AQ (c) distributions for the energy range 1-2 TeV and for 
the zenith angle range 10-20°. The filled distributions correspond to the 7-ray events, while 
the hatched distributions correspond to the cosmic-ray background. 

4-3.1. The Q parameter 

The discriminant parameter named Q is based on the expected difference in the 
angular distance on the sky between the two reconstructed shower directions when using 
HiUas and the HiUasOnModel ellipses: 



where -OHiUas and "yHiUasOnModei are the normalized vectors corresponding to the two direc- 
tions. The shift in the mean value of the Q distributions (see Fig. [5^) is due to the fact 
that the difference in the major axis of the two ellipses is small for a well- fitted 7-ray 
and larger for a badly-fitted hadron. 

4.3.2. The Re parameter 

The energy reconstruction procedure described in Sec. [3] allows the derivation of the 
following parameter based on the ratio of two reconstructed energy values: 



where the energy of the event, -Emiias; is estimated with the Hillas parameters of the 
detected images, and the additional energy estimation, -EniUasOnModeh is carried out based 
on the HiUasOnModel parameters. This ratio is expected to have small deviations from 
zero for well-reconstructed 7-rays, while for misreconstructed hadrons larger deviations 
are expected. The distributions of Re for simulated 7-rays and for real background are 
illustrated in Fig. [5]d, and these differ as expected, the background having a large tail 
towards smaller Re- 

4.3.3. The AQ parameter 

The third new parameter also takes advantage of the new energy reconstruction 
procedure. Having calculated the event energy and knowing the reconstructed impact 
parameter of the event, the second phase of the procedure for the energy evaluation per 



Q = logio(arCCOs({;Hillas ■ l^HillasOnModel)) 



(4) 



Re = fogio 



-E'HillasOnModol 



(5) 
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telescope described in Sec. [3] can be inverted to find the expected image charges gxeii on 
each telescope. A discriminant parameter based on the deviation of q^eii with respect to 
the measured charges q^eii can be defined using a least squares in charge: 

^Q = 7^Y1 [^Tel, - QTelf (6) 
^tot .^^ 

where A^Teis is the number of telescopes in the array and Qtot is the summed charge 
in all telescopes as defined previously in Eq. |3l For poorly-reconstructed events, an 
inconsistency between the energy evaluation and the reconstructed impact position is 
expected. Hence, the predicted charge gxeii on a telescope for such an event is expected to 
be very different from the gxei^ measured (or not measured, in the case where a telescope 
did not trigger). As shown in Fig. Et, the shape of the AQ distributions reflect the 
expected behaviour, the background events having larger values. 

5. Multivariate analysis 

In order to achieve an efficient hadron rejection, all the above-mentioned parameters 



are used within a multivariate discrimination scheme based on the TMVA [13| package 



within the ROOT [Ij] framework. Among all the available discrimination algorithms, 
this work had initially focused on neural networks {Multi Layer Perceptron or MLP) and 
on Boosted Decision Trees (BDT). During these early tests, it became immediately clear 
that the BDT technique was more stable and gave better results in terms of signal-to- 
background discrimination, thus confirming that this technique is the best out-of-the-box 
multivariate procedure, requiring minor adjustments to be applicable. 

A summary of the operation of the BDT technique is given in Appendix [Al with 
details particularly concerning the design of the decision trees, the difficulties which can 
be encountered through overtraining, and the best pruning strategies to apply for the 
case of lACTs. The multivariate classification technique consists of four main phases: 
the first one is the training (see Appendices lA.ll IA.2p where the "forests" of decision 
trees are constructed, with a set of events randomly chosen from the input. Then, in a 
second phase, the classifier performance is tested with an independent set of events, and 



evaluated (see Appendices IA.31 [A.4p . This study led to a specific design of the decision 



trees, differing somewhat from the default options of the BDT procedure as implemented 
in TMVA. Since these details could be useful for the implementation of this technique 
for other lACTs, they are included in Appendix IA.5I After the definition of the final 
cut values, the classifier is finally applied to the data (see Appendix IA.6|) . 



5.1. Simulated 'j-ray and measured hadron background data samples 

The 7-ray shower samples were produced by MC simulations with KaskadeQ, 



la, [ITJ. The 7-ray showers are simulated between 30 GeV and 120 TeV for two different 



azimuths {North and South), several zenith angle bands spaced regularly in cos(^zcn) (in 



^One of the two shower simulation packages used in H.E.S.S., the other being Corsika 18 1 
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Figure 6: Left panel: number of fitted photons in the reconstructed shower (nphot) as a 
function of the reconstructed energy in TeV (log scales). Right panel: the definition of the 
nphot bins and the approximate corresponding values (based on the left-hand figure) of the 
reconstructed energy (in TeV) are illustrated. 



steps of 0.05) from 0° to 70°, and for a point-like source. The resulting Cherenkov light 
is then propagated through the H.E.S.S. detector with Smasi^, ^6|, where the source 
position can be shifted to different offsets D from the centre of the camera (six D values 
spaced by 0.5° from 0° to 2.5° are simulated, where 2.5° corresponds to the radius of the 
H.E.S.S. field-of-view or FoV). Taking advantage of the large set of available H.E.S.S. 
data as a background sample, the use of time-consuming and marginally-reliable hadron 
simulations could be avoided. The background was therefore extracted from extragalac- 
tic observations using regions of the camera which do not contain any known 7-ray 
source, at various zenith angles, and taking into account the northern or southern az- 
imuth of the observations (to account for different geomagnetic field effects). All the 
hadrons reconstructed out to angles including the H.E.S.S. FoV plus 0.2° are taken into 
account. 

5.2. Training and Test Configurations 

A set of bins in energy, zenith angle, and telescope multiplicity (i.e., the number of 
triggering telescopes; two to four in the case of H.E.S.S.) are defined for the training/test 
phases of each of the implemented analysis configurations (see Sec. 15.41) . 

There are eight zenith angle bins and seven energy bins, in order to regroup the 
different classes of events showing similar properties. The shape of the parameter distri- 
butions for each defined group may differ from the other groups, leading to a different 
background rejection performance. The zenith angle bins are defined in 10° intervals 



This program is one of the two available H.E.S.S. detector-response simulation programs. 
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from 0° up to 40°, then in 5° intervals, and finally in a single interval^ from 55° to 90° 
for each of which seven energy bins are defined by the number of photons in the shower 
as fitted by the 3D-model (referred as to nphot), as shown in Fig. El The latter choice 
has been found to be effective in avoiding brusque variations in the final effective area. 

The phase space is additionally subdivided in two bins in telescope multiplicity: one 
for the two-telescope events and one for the three and four-telescope events, leading 
to a total of 112 bins altogether. This split has been motivated by the fact that for 
events which trigger only two telescopes, less information is available for geometric 
reconstruction, therefore leading to a parameter space which is considerably different 
with respect to that for events triggering three and four-telescopes. It should be noted 
that in order to avoid badly-reconstructed 7-ray events polluting the training and test 
phases, such events are excluded using a cut on the angular distance on the sky between 
the reconstructed event direction with respect to the source direction (< 0.11°). 

5.3. Relative importance of the parameters for discrimination 

The multivariate method ranks the discriminant parameters according to the fre- 
quency with which they are used in the splitting of the decision tree nodes. The values 
shown in the boxes of Fig. [7^ are given by the sum of the relative weights given by the 
BDT training. It can be observed that at low energies {E < 500 GeV) the most impor- 
tant discriminant parameter is the SD-model estimated depth of the shower maximum, 
with its importance decreasing as a function of energy. The second parameter in order 
of importance is Q: while its discrimination performance is clearly high in all energy 
bins, it is more powerful at low energie^ {E < 500 GeV, bin in nphot < 3) or high 
energies {E > 5 TeV, bin in nphot > 7). The least discriminant parameter in almost 
all cases is MSCL. The importance of the chosen parameters can also be evaluated by 
checking the relative quality factor when progressively adding groups of variables to the 
training, where the quality factor is defined as 

Q = where e^,bkg = A^cut/A^reco (7) 

V^bkg 

with Ncut, Nj-eco being the number of events after and before cuts, respectively, in a given 
bin. Three different test configurations are defined: the first group includes only MSCL 
and MSCW; the second comprises MSCL, MSCW plus the group of the three chosen 
3D-model parameters; and the third adds the three new ones defined for the purpose of 
this work to the preceding groups. The results are given in Fig. [7)d: a significant gain 
in performance is achieved when adding the group of the three 3D-model parameters to 
the Hillas-parameter based ones; and with the addition of the new parameter group a 
further ~ 20% of background is cut away, leading to an additional gain in Q of ~ 12%. 



^Note that the definition of zenith angle bins which are regularly spaced as a function of cos(0zcn) 
as used in the MC simulations is more convenient; however, the different choice used here for the 
MVA training has been imposed by some initial compatibility constraints with other modules inside 
the H.E.S.S. Analysis Package. 

^"^Though fl is still less discriminant than the depth at shower maximum. 
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Figure 7: (a) Two-dimensional plot of the importance of the parameters during training as 
a function of the bin in nphot. The value shown in each box is the sum over all zenith angle 
bins of the relative weights, given by the BDT training to each discriminant parameter (see 
text). Depth, ErrW and RedW refer to the 3D-depth, the 3D- width-err and the 3D- width, 
respectively (b) Quality factor (QF) as a function of the nphot bin for given 7-ray efficien- 
cies, at a zenith angle of 0° and for a configuration having a charge threshold of 80 p.e. per 
telescope. Dotted, dashed and continuous lines represent the QF obtained with the following 
configurations of parameters: a first group (MSCL, MSCW), a second group (adding three 
3D-model parameters), and a third group (adding the three new newly defined parameters), 
respectively. 



Finally, it should be noted that investigation of the effects of the correlations between 
the discriminant parameters resulted in the elimination of a few parameters which had 



been defined at an early stage of the analysis development (see [19|). Optimization tests 
within the specific framework used here showed that the inclusion of pairs of highly- 
correlated variables resulted in a loss of discrimination performance, probably due to 
the overall development of the trees in the forest being more sensitive to fluctuations!^ 

5.4- Definition of cuts for different source types 

The training on each event class described in Sec. 15.21 provides a separate BDT clas- 
sifler (see Fig. [8] for two examples), on which the analysis cut is based. For the deflnition 
of the flnal cut values, the use of the highest signiflcance criterioiiEl provided by the BDT 
training process was found to be subject to statistical fluctuations. For this reason, 7-ray 
efficiency profiles as a function of energy were defined in order to optimize the detection 



^^Tests of the TMVA functionality which implements parameter decorrelation provided inconclusive 
results, though this may be explored in further developments. 

^^Which depends on the relative normalization between signal and background. 
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Figure 8: The BDT classifier response to the BDT training and test phases for the bin 
between 10-20° in zenith, and between 1-2 TeV for two-telescope events (a) and for three- 
telescope and four-telescope events (b). The response to both the training and test samples 
are shown (points ans histograms respectively). 



sensitivity for sources with different fluxes, through an iterative optimization process. 
For illustration, the pre-defined 7-ray and corresponding background efficiencies for one 
analysis configuration, of the five described below, is shown in Fig. [HI For the purpose of 
the analysis of sources showing different spectral indices, five configurations are defined 
based on the following charge thresholds: 40, 60, 80, 110 and 150 p.e. 

To achieve an optimal sensitivity for sources with spectral indice^ T < 3 and fluxes 
above ~ 1% C.U., an analysis configuration having a charge threshold of 80 p.e. (and 
a similar one at 60 p.e.) has been developed. This configuration is therefore adapted 
for most Galactic-plane observations, where most of the sources discovered have hard 
spectra (see [2l|) and is also used for low-redshift extragalactic observations. For similar 
spectral indices but lower fluxes (< 1% C.U.), another configuration has been defined 
with a charge threshold of 110 p.e. and for which the 7-ray efficiencies have been set so 
as to yield a lower rate of low-energy two-telescope events than the other configurations, 
see Fig. [91 These two characteristics help in reducing the lower energy fluctuations 
due to the background events, thus enhancing the sensitivity for this source class. For 
morphological studies of extended sources, a dedicated configuration has been developed 
which has an enhanced PSF, optimizing the performance in geometrical reconstruction 
by raising the charge threshold to 150 p.e. 

For high-redshift extragalactic observations where the effect of 7-ray absorption be- 
comes significant, steep-spectrum (F ~ 3) sources are naturally expected. To optimize 
the sensitivity for the detection of this class of sources, the image size threshold has been 
lowered to 40 p.e. in order to enhance the signal of the abundant lower energy events. 

App. |B] discusses the instrument response functions associated with these configura- 
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Figure 9: Discrimination performance for the analysis configuration having a 110 p.e. charge 
threshold. The solid lines show the 7-ray efficiencies within the angular pre-cut mentioned in 
Par. 15.21 while the dashed lines show the corresponding hadron efficiencies multiplied by 10. 
These are shown as a function of the nphot bin for a zenith angle of 0°, (a) for two-telescope 
events and (b) for three and four-telescope events. The definition of the photon bins (nphot) 
can be found in Fig. [6l 

tions. 

6. Robustness and consistency checks 

6.1. Robustness with respect to variations of the NSB rate 

One of the key points of the present approach is its ease of implementation without 
the need for cahbration of the discriminant variables as a function of the NSB rate for a 
given FoV. With the H.E.S.S. telescopes, the NSB median value of the latter can vary 
from few tens of MHz per pixel for extragalactic observations, up to ~ 130 MHz for 95% 
of H.E.S.S. observations and for central parts of the Galactic plane. Fig. [TOb shows the 
distribution of the BDT response for simulated 7-rays following a power-law (F = 2) at 
Zenith, with different NSB rates ranging from 40 to 200 MHz. The stability of the BDT 
response to NSB rates up to 150 MHz is remarkable: the relative migration of events 
from the signal to the background range remains well below 2%. For a larger NSB rate 
of 200 MHz, the migration attains ~10%, however, the resulting systematic error on 
the effective area (i.e., 10%) remains dominated by the error due to the uncertainty 
in the energy scale jsj, depending on the spectral index of the source. A check of the 
energy dependence of such an effect has also been carried out through reconstruction of 
simulated 7-rays following the same power-law and the same NSB levels as above. No 
significant bias was found on the measured spectral index. This was expected, since the 
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Figure 10: (a) BDT response for simulated 7-rays for different NSB rates ranging from 40 to 
200 MHz. Tlie sample was simulated at Zenith and follows a power-law with differential index 
r = 2. The BDT response remains stable for NSB rates up to 150 MHz, the relative migration 
of events from the signal to the background range remaining well below 2%. For a larger rate 
of 200 MHz, the latter attains ~10%, without significant impact on the measurements (see 
text), (b) BDT response for 7-ray simulated events (solid line) and real 7-ray events (crosses 
with error bars) from the Crab nebula. 




Figure 11: Reconstructed spectrum of the Crab nebula (points), showing an excellent agree- 
ment within the systematic errors with the published results (dashed line)0]. The residuals 
are calculated with respect to the fitted spectral shape. 
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choice of those variables aheady available (MSCL, MSCW, 3D-width, 3D-width-err, 3D- 
depth, see Sec. 14.11 and l4.2p . and the definition of the new discriminant parameters {Q, 
Re and AQ, see section was made following the study of their individual robustness 
with respect to the NSB rate. 

6.2. Consistency checks using real'y-ray data 

The consistency of the BDT response was checked by comparison of its distribution 
for simulated 7-rays to that of the signal from the Crab nebula. MC simulations were 
made for 7-rays with a spectrum following a power-law up to 100 TeV with a differential 
photon index T = 2.6, and a zenith angle of 46°, matching closely the chosen data-set 1^ 
As can be seen on Fig. [TUb . there is a good agreement between the simulated sample's 
BDT distribution and that of the Crab nebula signal. 

Finally, as is mandatory in VHE 7-ray astronomy, an overall consistency check of the 
analysis chain was performed by comparing the spectrum as measured with the present 
MVA approach to that obtained with the standard Hillas-parameter based one, for a 
source such as the Crab nebula . As shown in Fig. [TTl fitting an exponentially cut-off 



power-law to the data in the range 380 GeV to 44 TeV using the method in [20j yields a 
fiuxof/o = (4.31±0.07stat)xlO"i^cm-2s-iTeV"\ an index of F = 2.43±0.03, and a cut- 
off energy Ec = 15.4 ± 2.2 TeV. These values are fully compatible with those published, 
evaluated in the range 410 GeV to 40 TeV: Jq = (3.76 iO.OT^tat) x lO'^^cm-^ TeV"\ 
F = 2.39 ± 0.03, and Ec = 14.3 ± 2.1 TeV, noting that the data-set used here is twice as 
large as that in [3| and that the difference (~15%) in the differential flux is well within 
the systematic error of the lACT measurements. The same type of check made on other 
established sources yielded an excellent level of compatibility, further reinforcing the 
confldence in the approach presented here. 

7. Performance with published H.E.S.S. sources 

The performance of the analysis scheme proposed here has been tested on a set of faint 
H.E.S.S. sources. The spectral index is used to select, a priori, two test conflgurations 
which are expected to be the best adapted. The results obtained for a sample of five 
cases are presented in Tab. 1 and compared to the standard analysis (labelled as Hillas), 
so as to illustrate the trends as a function of the sources' spectral indices. 

There are four extragalactic sources ordered by spectral index - the three blazars 
PC 1553 + 113, lES 0347-121 and H 2356-309, and the radio-galaxy Cen A - together 
with one hard-spectrum Galactic source, the composite supernova remnant GO. 9 + 0.1 
(references are given in the table). 

The main trend which can be seen is that the best-adapted MVA configuration for 
the source with the softest spectrum (PC 1553 + 113) is the one with the lowest charge 
threshold (i.e., 40 p.e.), whereas the 110 p.e. configuration yields the best performance 
for the hardest spectrum source (GO. 9 + 0.1). The 80 p.e. and 40 p.e. configurations are 
somewhat equivalent for sources with spectral indices F ~ 3, a value which appears to 



^^The NSB simulated rate is 100 MHz per pixel, close to the median value measured on the Crab 
nebula FoV as seen by H.E.S.S. 
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Table 1: Performance of the analysis on some published H.E.S.S. sources. For each, published 
results obtained with Hillas-parameter based analyses are shown in the first line and results 
obtained here with the MVA method presented in this paper are shown just below. F, (jjQ and 
are the published differential spectral index and flux in C.U., ^zen is the mean zenith angle 
of observation in degrees, "LT" is the livetime in hours, "On" represents the total number of 
events around the position of the source in the sky, "Off" represents the normalised number of 
background events, is the number of excess events. No- is the significance of the detection 
and (j/Vh is the significance over the square root of the observation livetime in hours, for 
which the best-adapted configuration is shown in bold. The normalisation factor (a in Eqn. 
17 of [i^l) depends on the source observation conditions (the offset, the angular cut between 
the reconstructed direction and the source position, and the disposition of excluded regions in 
the FoV), and varies from 0.06-0.125 for the sample shown. 



constitute a pivot index above which the use of the lowest charge threshold configuration 
is the best adapted. 

In more detail, indeed, the MVA 40 p.e. and 80 p.e. configurations give a similar gain 
in sensitivity over the standard Hillas-parameter based analysis for the two blazars for 
which r ~ 3 (by a factor of about 1.2 for both configurations for lES 0347 — 121, and 
1.5-1.6 for H 2356-309). In contrast, for PG 1553 + 113, MVA at 80 p.e. is significantly 
less efficient than that at 40 p.e0 On the other hand, for Cen A, which shows a harder 
spectrum (F ~ 2.7) than the other extragalactic sources, the MVA 80 p.e. configuration 
becomes dominant over the 40 p.e. one, with a clear gain in sensitivity as compared to 
the standard Hillas analysis by a factor of 1.8 (versus a factor 1.2 for the 40 p.e. case). 

With a gain in sensitivity of the MVA approach over the Hillas-parameter based 



^^For the very soft spectrum source PG 1553 + 113, the reference in the table also applies specific 
Hillas soft spectrum cuts with a 40 p.e. threshold, similarly to the best MVA configuration here. 
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analysis ranging from a factor 1.2 to 1.8 for the sample of sources shown here, this 
corresponds in a gain in observation time by a factor 1.4 to 3.2, as can be seen from the 
column a/ a/E of Tab. 1. 



8. Conclusions 

A new analysis method is presented, based on an MVA approach optimized for the 
detection of low-flux sources, conceived so as to be easily and quickly adaptable to other 
current and future lACTs. 

Three new discriminant variables are combined with the already-known Hillas-param- 
eter based variables mean-scaled image width and length (MSCL, MSCW), and with 
carefully-selected parameters from the more recent 3D-model (the 3D-width of the 
Cherenkov photosphere, its error, and the 3D-depth of shower maximum) in a multi- 
variate discrimination procedure using Boosted Decision Trees, where this analysis has 
been adapted to the particularities of detection with lACTs. Two of the new discrimi- 
nant parameters - one of which uses a new energy reconstruction method described here 
- are based on the Hillas moments calculated from the 3D-model predicted images in the 
different telescopes, which provides a means to take into account the inter-telescope cor- 
relations which are missing from a simple Hillas-parameter based discrimination frame- 
work. For the third new discriminant parameter, the energy evaluation method is in- 
verted to predict the expected image charges and compare them in a least squares with 
the images measured in each telescope. 

A number of BDT classifiers are defined, in order to take account of the different 
classes of data, with bins in zenith angle, telescope-multiplicity and reconstructed en- 
ergy. Note that the number of photons in the Cherenkov photosphere as fitted by the 
3D-model is used as a proxy for the reconstructed energy as this is found to be effec- 
tive in avoiding brusque variations of the final effective area. The robustness of the 
new analysis method is shown with respect to the variations in NSB for MC simulated 
7-rays by examining the evolution of the BDT response. The consistency is checked by 
comparing the BDT response with that for the real 7-ray excess from the Crab nebula 
as seen by H.E.S.S. A final consistency check is performed by comparing the spectrum 
from the Crab nebula as derived by the present MVA approach to that published by 
H.E.S.S. using the Hillas-parameter based analysis, and the two spectra are shown to 
have an excellent compatibility. The performance of the different groups of parameters 
when applied to MC simulated 7-rays and real background data is presented, showing 
the relative gains achieved in each case above the standard Hillas-parameter based anal- 
ysis. For the application to the analysis of real sources, several configurations have been 
defined so as to allow cut optimization for different expected source characteristics (in- 
tensity and spectral hardness), as well as for studies of source morphology. The method 
is apphed to a set of known H.E.S.S. sources with measured characteristics. The gain 
in sensitivity (1.2 to 1.8) is in conformance with the expectation based on their spectra 
and intensities. 

The method is currently being applied successfully to H.E.S.S. observations, con- 



tributing to discovery of new faint sources, see e.g., [28[. The overall development of 



this new MVA approach presented here has been carried out in order to be easily adapted 
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and applied to other current and future stereoscopic arrays - such as VERITAS, and 
in the near future the HESS-II upgrade (in the stereo energy regime) - this ease of ap- 
phcation being achieved thanks to the rapidity and flexibihty of the method presented. 
These features have been recently demonstrated through successful application for the 
future Cherenkov Telescope Array (CTA) project. 

Future improvements of the method include a deeper study of the parameter decor- 
relation prior to the use of the MVA training procedure, as mentioned in Sec. 15.31 and 
a further investigation on the possibility to mix the two reconstructed directions (the 
Hillas-parameter based and the 3D-model ones) as a function of an observable related to 
the energy (see Sec. |2]). The study of the stability and performance of the method as a 
function of a degraded performance of the telescopes through MC simulations constitutes 
also an interesting point to be investigated in a future work. 

A. Decision tree generalities and design 
A.l. Training phase 

The training phase of a multivariate background rejection method consists of the 
construction of a decision tree (see Fig. [T2b) given a pre-defined specific binary tree 
architecture. The separation of the events into signal and background is carried out 
starting from a root node using the parameter giving the best separation performance 
and according to a chosen splitting criterion. The decision tree continues to grow until 
its pre-defined maximum size has been reached or a minimum number of events are 
present in the node. 

A. 2. Boosting 

A single decision tree is, however, unstable with respect to statistical fluctuations, so, 
once the first tree is grown, the same procedure is repeated in order to build an ensemble 
of decision trees (called a forest) by reweighting the same sample of events (this step 
is called the BDT boosting). The boosting has the important feature of contributing 
greatly to the statistical stability of the algorithm and further improving the separation 
performance. 

A. 3. Overtraining and pruning 

A key point during the training phase is the need to avoid the so-called overtraining 
of the multivariate procedure: the overtraining phenomenon occurs when the separation 
is too powerful for the training sample, generally resulting in the overestimate of the 
real discrimination power. If overtraining occurs, the multivariate method is then not 
capable of learning the real trend in the data, but simply memorizes the data by heart 
resulting in a poor generalization of the problem. When in presence of overtraining, 
which leads to a too-good fit to the training data, the overall performance of the method 
on an independent set of data is generally bad. So, in order to maximize the separation 
performance algorithm and also to avoid keeping statistically insignificant nodes during 
the decision process, a technique known as pruning is apphed to the tree by cutting 
back the poorly populated nodes from the bottom part where the tree has reached its 
maximum size, up to the root node. 
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Figure 12: Left panel: Schematic view of a decision tree. Starting from the root node, an 
example of a possible tree development is illustrated. A sequence of binary splits using the 
defined discriminant variables is applied to the data. At each node, a decision between signal 
and background is carried out using the variable giving the best separation performance. See 
[lit further details. Right panel: Parameters used to construct the architecture of the 
Boosted Decision Trees applied for the discrimination procedure presented. 



A. 4- Classifier output 

At a final stage of the BDT procedure, all the trees are combined together resulting in 
a single classifier output given by the average (or weighted) behaviour of the individual 
decision trees. The classifier resulting from the training phase is tested to evaluate the 
global performance of the multivariate method and this is done during the test phase. 
During the testing, an independent set of events is used as input for the sphtting proce- 
dure so that the training classifier distribution can be compared with that resulting from 
the test. The comparison of the two classifiers is usually evaluated by a Kolmogorov test 
on the shape of the two resulting distributions. As shown in Fig. [HI the Kolmogorov test 
value is an important check on the reliability of the signal-to-background classification 
behaviour. If the statistical test of compatibility fails, the reliability of the analysis 
results will not be guaranteed. 

A. 5. Design of the decision tree 

The result of the tuning of the values defining the tree architecture in our specific 
parameter environment, is summarized in the table shown in Fig. [T2b . The tests car- 
ried out by the authors showed that the discrimination performance was enhanced when 
allowing the tree to develop quite deeply and then cutting back the branches with an 
optimal PruneStrength. For this reason, in this analysis the splitting can be carried out 
up to a maximum tree depth of 20 node layers (MaxDepth), however, the sequence of 
event separation splits is stopped once a minimum number of 30 events has been reached 



22 



in a node {nEventsMin) . Having fixed all the other training parameters, the discrimina- 
tion performance can vary substantially depending on the value of PruneStrength. Tests 
have shown that the optimal choice was to set this value to —1, giving the possibility 
to use an algorithm which tries to detect the optimal strength of pruning for the given 
problem. 

Every bin in this analysis is a separate entity having its own number of signal and 

background events, and a specific approach has been used for the calculation of the 
statistics available for each bin and for the choice of the number of trees composing the 
different forests. Given a total number of background and signal events {Ngig, -/Vbkg) 
available for a specific bin, the number of events to be used during the training phase 
-Strain IS Calculated in the following way: 

-/Vtrain = min(ntram, 1-2 X 10^); where ntrain = min(0.6 x Nsig, 0.6 x A^bkg)- (8) 

The resulting number of events for the different bins can vary considerably due to thresh- 
old effects as a function of the zenith angle and due to the available simulations and real 
observations. In the 112 bins, A^train can vary from 2 ■ 10'^ to 3 ■ 10''. Finally, the number 
of events during the test phase is calculated by the remaining set of events after iVtrain 
subtraction by: 

ntest = min(A^sig - A^train, A^bkg - A^train); A^'test = min(ntest, 1-2 X 10^) (9) 

During the development of this analysis method, it has been observed that an insta- 
bility in performance occurs when the number of requested trees in the forest is high and 
the number of events present in the sample is low. For this reason, the number of trees 
composing the forest has been adjusted as a function of the available A^train statistics 
present in each bin as: 

^trees — 

100 X (ATtraln/lO^); with ATtrees 

= min(max(ntrees, 10), 200); (10) 
leading to forests composed of 10 to 200 decision trees. 

A. 6. Cut application during analysis 

The final step of the multivariate procedure is the application of the cuts during the 
data analysis using the BDT classifier adapted for the mean zenith angle of each run. 
For each event in the run, the values of the discriminant parameters are evaluated, and, 
according to the number of photons in the shower as fitted by the 3D-model and the 
telescope multiplicity of the event, the weights and the cut value of the corresponding 
training bin can be identified straightforwardly. For events whose parameters correspond 
to a bin for which not many simulated events were available for the training/test phases 
(which is the case for instance for the events whose energy is well below the threshold 
for a given zenith angle) , no discrimination is possible and the event is classified as being 
background. 
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Figure 13: For the two different analysis cuts (40 p.e. and 150 p.e., dashed and continuous 
hue respectively, see Sec. 15. 4p . (a) Point-spread functions defined as the containment radius 
at 68% of all the events passing the multivariate cuts as a function of the observation zenith 
angle for 0.5° offset, (b) Effective area of events passing the multivariate cut and the point-like 
source cut defined in Sec. [B]for a zenith angle of 18°. 

B. Point-spread functions and effective areas 

The response of the instrument to 7-ray point sources, or 7-ray point-spread function 
(PSF), is usually characterised by the 68% containment radius. It is important that this 
be small in order to allow the background to be better rejected. The 68% containment 
radii (see Fig. [T3k ) remain almost constant up to zenith angles of 40°, so for the eval- 
uation of the final cuts for point-like sources the mean values in the range 0°-40° for 
the analysis configuration under consideration are taken, and, given that the systematic 
error of the pointing accuracy is 0.005°, slightly larger values are applied as cuts for the 
final analysis: a good compromise is found with 0.11° at 40 p.e., 0.10° at 80 p.e. (and 
60 p.e.) and finally 0.09° at 110 p.e. The 150 p.e. configuration has been conceived for 
source morphology studies where a very good PSF is requested. As shown in Fig. [T3k . 
thanks to the charge threshold and the chosen 7-ray efficiencies, the 68% containment 
radius for this latter configuration reaches 0.075°. The effective detection areas in the 
case of a point-like source for the analysis configurations, evaluated after this additional 
angular cut, are shown in Fig.[T3b. As expected from the different minimal charge values 
for the Hillas-parameter based reconstruction, the configurations have different energy 
thresholds and different integrated effective areas. 

When setting the charge threshold at 40 p.e., a mean gain in detection threshold 
of a factor of about 2 with respect to the 150 p.e. charge threshold is seen, as can be 
deduced from effective areas in Fig. [T3b . which is extremely important for the study of 
sources with soft spectra. On the other hand, when a higher charge threshold is set, the 
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effective area is smaller, with the compensation of a better PSF. 
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